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The Higgs boson is the most anticipated discovery at the LHC, which can only partially explore its true 
nature. Thus one of the most compelling arguments to build a future linear collider is to investigate 
properties of the Higgs boson, especially to test the predicted linear dependence of the branching ratios 
on the mass of the final state. At a 3 TeV CLIC machine the Higgs boson production cross section is 
relatively large and allows for a precision measurement of the Higgs branching ratio to pairs of b and 
c quarks, and even to muons. The cross section times branching ratio of the decays H — > bb, H — > cc 
and H — > fi + n~ can be measured with a statistical uncertainty of approximately 0.22%, 3.2% and 15%, 
respectively. 

The electroweak symmetry breaking mechanism, called Higgs mechanism, predicts a fundamental spin-0 
particle, whose existence currently is being investigated at the LHC. The answer to the question about its 
existence is expected in 2013. The Standard Model predicted linear dependence of the branching ratios on 
the mass of the final state could be altered by non-Standard Model couplings. The LHC can deliver only 
very limited measurements of the Higgs sector, but a detailed exploration is crucial for a deep understanding 
of its nature. 

The compact linear collider (CLIC) is a proposed e + e~ collider with a maximum centre-of-mass energy 
= 3 TeV, based on a two-beam acceleration scheme [1]. In the following we present the analysis of the 
measurement of the branching ratios H — ► bb, H — > cc [2] and H — > n + [3] at such a machine. The studies 
are based on fully simulated and reconstructed samples in the CLIC_SiD [4] detector concept and take into 
account the relevant background processes as well as the main beam-related background. 



1 The CLICJSiD detector concept 

The CLICJSiD detector, used in the full simulation of samples, is based on the SiD detector concept [5] 
developed for the ILC project. It is designed for particle flow calorimetry using highly granular calorimeters 
and has been adapted [4] to the specific requirements at CLIC. 

A superconducting solenoid with an inner radius of 2.9 m provides a central magnetic field of 5T. The 
calorimeters are placed inside of the coil and consist of a 30 layer tungsten-silicon electromagnetic calorimeter 
with 3.5 x 3.5 mm 2 segmentation, followed by a tungsten-scintillator hadronic calorimeter with 75 layers in 
the barrel region and a steel-scintillator hadronic calorimeter with 60 layers in the endcaps. The read-out 
cell size in the hadronic calorimeters is 30 x 30 mm 2 . The iron return yoke outside of the coil is instrumented 
with 9 double RPC layers with 30 x 30 mm 2 read-out cells for muon identification. 

The silicon-only tracking system consists of 5 20 x 20 |a.m 2 pixel layers followed by 5 strip layers with a 
pitch of 25 um, a read-out pitch of 50 um and a length of 92 mm in the barrel region. The tracking system in 
the endcap consists of 5 strip disks with similar pitch and a stereo angle of 12°, complemented by 7 pixclatcd 
disks of 20 x 20 um 2 in the vertex and far-forward region at lower radii. 
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Table 1: List of processes considered for this analysis with their respective cross section a and the number 
of simulated events N events . The cross section takes into account the CLIC luminosity spectrum and initial 
state radiation. Cross sections marked with * include a cut on the invariant mass of the muon pair at 
generator level to lie between 100 and 140 GeV. 



2 Analysis framework and data samples 

The physical processes are generated with the Whizard [6, 7] event generator, with fragmentation and 
hadronization done by Pythia [8]. The full simulation and reconstruction is performed in the software 
framework of the CLIC.SiD detector concept. 

The event simulation is performed using SLIC [9], a wrapper for Geant4 [10], while the reconstruction is 
done by lesim and PandoraPFA packages. We assume a total accumulated luminosty of 2 ab -1 , corresponding 
to 4 years of data taking at the nominal machine parameters. Table 1 lists the physics processes that were 
taken into account in the analyses, together with their cross sections and the number of simulated events. 

The dominant Higgs boson production channel at 3TeV is the WW fusion channel e+e - — > Hvv with 
a cross section of grvd = 420 fb. The main background for all channels is the Z boson production via WW 
fusion, which has similar kinematics as the signal processes. 

Beamstrahlung effects on the luminosity spectrum as well as initial and final state radiation are taken into 
account. For the default configuration of a 3TcV CLIC [1], 3.2 yy — > hadrons events per bunch crossing are 
expected on average. With a spacing of 0.5 ns between bunches, these necessarily pile up in the subdetectors, 
for which we assume integration times of 10 ns, except for the barrel hadronic calorimeter, which has 
an integration time of 100 ns. To approximate the CLIC beam structure and background conditions, the 
equivalent of 60 bunch crossing of YY — > hadrons events were mixed with every simulated event. In the 
H — > /i+yu - analysis, only the signal sample was mixed with events from yy — > hadrons background. 

For the processes involving jets in the final state, fragmentation products of the hadronic systems are 
forced to two jets using the exclusive k t algorithm of the Fast Jet package [11], where the parameter R is set 
to 0.7. The LCFI vertexing package [12] is used to identify jets according to their quark content as b, c and 
light quarks and computes the corresponding jet flavour tag values. 

The event classification in H — > bb and H — > cc analyses is based on the open source Fast Artificial Neural 
Network (FANN) package [13]. FANN was modified to account for event weights during the neural network 
training. The event classification in the H -4- ^+ [i~ analysis is done using the boosted decision tree (BDT) 
classifier implemented in the TMVA package [14]. 
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Figure 1: In the left plot, the mis-tag rate in the CLIC_SiD detector for charm (blue) and light (green) jets 
as a function of the b-tag efficiency is shown. The right plot shows the mis-tag rate for bottom (red) and 
light (green) jets as a function of the c-tag efficiency. The mean pt of the jets is 70 GeV while the mean 
energy is ~ 130 GeV. 

3 Measurement of H — > bb and H — > cc 

The measurement of the H — > bb and H — > cc decays requires resolution of secondary vertices from the 
primary vertex and is thus an important benchmark of the vertex tracking detector design. 

3.1 Jet flavour tagging 

The flavour identification package developed by the LCFI [12] collaboration consists of a topological ver- 
tex finder ZVTOP, which reconstructs secondary interactions, and a multivariate classifier which combines 
several jet-related variables to tag bottom, charm, and light quark jets. Displaced vertices are the most sig- 
nificant characteristic of b quark decays. A combination of several vertex-related variables, complemented 
by additional track-related variables, form an input for the tagging classifier. A detailed description of the 
variables and the procedure are given in [12]. 

The probability to tag a jet with a false flavour, the so called mis-tag rate, is used to assess the performance 
of the flavour tagging package. Figure 1 (left) shows the mis-tag rate for c-jets (blue line) and light jets 
(green line) as b-jets versus the b-tag efficiency, while Figure 1 (right) shows the mis-tag rate for b-jets 
(red line) and light jets (green line) as c-jets versus the c-tag efficiency. The presence of yy backgrounds is 
found to reduce the flavour tagging performance, although the effect is not dramatic. The degradation of 
the flavour tag performance, shown in Figure 1, has two sources: the flavour tag degradation itself plus a 
degradation of the jet quality due to a more difficult jet finding. For instance, at the b-tag efficiency of 70% 
the mis-tag rate for c-jets (light jets) drops from 4.3% (0.19%) w/o overlay to 6.8% (0.33%) with overlay. 

3.2 Results 

The basic event selection requires two jets in each event. Apart from this selection, no further cuts are 
explicitly imposed and a number of relevant variables is given to a neural net for the subsequent multivariate 
analysis. The invariant mass of the jet pair is the major discriminant between decays of Higgs and of Z 
bosons. It is used in a event classification neural network, together with the output of the b-flavor tagging 
network and the following variables: 

• The maximum of the absolute values of jet pseudorapidities. 
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Figure 2: Statistical uncertainty of the measurement of cross section times branching ratio versus selection 
efficiency of the neural network. The neural network was trained to identify H — > bb decays from di-jet 
backgrounds including H — > cc (left). The neural network was trained on H — > cc as signal and di-jets 
backgrounds including H — > bb (right). 

• The sum of the remaining LCFI jet flavour tag values, i.e. c(udsb), c(b)-tags and b(uds)-tag 1 . 

• i?,^, the distance of jets in the t\ — (f> plane. 

• The sum of jet energies. 

• The total number of leptons in an event. 

• The total number of photons in an event. 

• Acoplanarity of jets. 

Two neural nets were trained to separate either the H — > bb or the H — > cc signals from background 
samples accounting for event weights. Thus the amount of the information about the signal, compared to 
the background, was proportional to its natural contribution. Such a solution delivers optimal results. It 
is more appropriate than, for instance, choosing the same number of signal and background events with no 
weights, or, training according to arbitrary sizes of the generated samples. 

The neural network selection efficiency versus the statistical uncertainty on the measurement is shown in 
Figure 2 for the two neural networks that were trained on H — > bb and H — > cc as signal, respectively. The 
optimal selection is at the local minimum of the curve, at a selection efficiency of 55% for H — > bb with a 
sample purity of 65%, and a selection efficiency of 15% for H — > cc corresponding to a sample purity of 24%. 
These values reflect the fact that b-jets can be distinguished from c-jets with high purity, while incompletely 
reconstructed b-jets and light jets make up a large fraction of the background to c-jet selection. Using the 
output of the reconstruction algorithms in neural networks leads to the minimal statistical uncertainty on 
the measurement at the eventual cost of an increased dependence on systematic effects. We assume that 
with sufficient experience at the running machine, the systematic variations are well enough understood so 
that the systematic uncertainties are comparable to the statistical uncertainties of the H — > cc channel and 
dominate in the H — > bb channel. 

The resulting statistical H — > bb cross section uncertainty amounts to 0.22 % while preserving meaningful 
values of both the sample purity (65.4 %) and of the signal selection efficiency (54.6 %). The H — > cc channel 
is more difficult to separate from the background and the statistical cross section uncertainty is 3.24% with 
a signal selection efficiency of 15.2%. 

lr The notion indicates which flavour is tagged against which set of other flavours. For instance, c(b) is the c-flavour tagged 
against the b-flavour only, while remaining (uds) flavours are not used during the neural net training. 
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Figure 3: Muon reconstruction efficiency for the signal sample with and without YY —> hadrons pile-up. 



4 Measurement of H — > 

The measurement of the rare decay H — > requires high luminosity operation and sets stringent limits 

on the momentum resolution of the tracking detectors. The branching ratio of the decay of a Standard 
Model Higgs boson to a pair of muons is important as the lower end of the accessible decays and defines the 
endpoint of the test of the predicted linear dependence of the branching ratios to the mass of the final state 
particles. 

The events are selected by requiring two reconstructed muons, each with a transverse momentum of at 
least 5 GeV. In this note, the most energetic muon is referred to as Ui and the second most energetic muon 
is referred to as \\.2- In addition, the invariant mass of the two muons Af(/i + /i~) is required to lie between 
105 GeV and 135 GeV. 

The muon reconstruction efficiency is shown in Figure 3 (left). The beam induced background from 
YY — > hadrons leads to a small deterioration of the muon reconstruction efficiency. The average muon 
reconstruction efficiency for polar angles greater than 10° is 98.4% with this background compared to 99.6% 
without. The total reconstruction efficiency of the signal sample, requiring two reconstructed muons with 
an invariant mass between 105 GeV and 135 GeV is 72% in the presence of background. 

The event classification is done using boosted decision tree classifier implemented in TMVA [14]. The 
BDT is trained to separate the n + |a. _ vv signal events from the \i + \i~e + e~ background. The |J. + u~, t + t~ 
and t + t - vv samples are not used in the training of the BDT, but are effectively removed by the classifier 
nevertheless. 

The variables used for the event selection by the BDT are: 

• The visible energy excluding the two reconstructed muons E v - ls . 

• The scalar sum of the transverse momenta of the two muons pt(M-i) +Pt((J-2)- 

• The helicity angle cos 9* = \p(^)\^( il +^ L -)\ > where ff is the momentum in the rest frame of the 
di-muon system. Since the two muons are back-to-back in the rest frame of the di-muon system there 
is no additional information to be gained from calculating a similar angle for u.2- 

• The relativistic velocity of the di-muon system P(/i + /i~), where (3 = 

• The transverse momentum of the di-muon system px(A* + / i ~)- 

• The polar angle of the di-muon system 9(/i + fi~). 

The major discriminant is the visible energy whenever there is an electron within the detector acceptance. 
Otherwise the background can be rejected by the transverse momentum of the di-muon system or the sum 
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of the two individual transverse momenta. Figure 3 (right) shows the Higgs peak in the invariant mass 
distribution after the event selection. 

The dominant background from e+e~ — > e + e~^ + /i~ events, is effectively reduced by forward electron 
tagging. While the forward calorimeters were not part of the full detector simulation, assuming a tagging 
efficiency of 95% down to an angle of 40 mrad for electrons of several hundred GeV to over one TeV is a 
conservative estimate, even in the presence of yy —> hadrons background. It is found that Bhabha events 
prevent further rejection of this background at lower angles. The results quoted are based on a ad-hoc 
rejection of 95% of the electrons in the Luminosity Calorimeter. 



4.1 Invariant mass fit 

The distribution of the invariant mass in the H — > n + n~ sample has a tail towards lower masses because of 
final state radiation. The shape can be described best by two half Gaussian distributions with an exponential 
tail. Together with the mean value this results in five free parameters in the fitted function, which can be 
written as 

' -(.- ron) 2 



f(x) = n < 



, x > mo 



where mo is the mean of both Gaussian distributions, ul and <jr are the widths, and oil and an are the tail 
parameters of the left and the right Gaussian distribution, respectively; n is a normalization parameter. The 
background is well described by an exponential parameterization, obtained from a background-only sample. 

The number of signal events is obtained from a maximum likelihood fit to the sample containing signal 
plus background after the event selection. 

The average muon momentum resolution of the fully simulated sample is 4 x 10 5 GeV -1 corresponding to 
a statistical uncertainty of 23% without the forward electron tagging. If the background from e + e~ — > |i + u~ 
can be reduced using tagging of electrons down to an angle of 40 mrad with an efficiency of 95%, the cross 
section times branching ratio can be measured to a precision of 15%. 



5 Summary 

The sensitivity to the decay branching ratios of a neutral 120 GeV Standard Model Higgs boson to bottom 
and charm quarks and to muons has been studied at the CLIC centre-of-mass energy of y/s = 3 TeV and 
integrated luminosity of 2ab _1 . The analysis is based on full simulation and realistic event reconstruction 
in the CLICLSiD detector. We have demonstrated the feasibility of such measurements and estimated their 
statistical uncertainty. 

For the measurement of Higgs decays to quarks, 0.22% and 3.2% statistical uncertainty can be achieved 
for the decays H — > bb and H — > cc, respectively. This includes the effect of background from yy — > hadrons 
on the flavor tagging. 

For the rare decay H — > ^ + ^ 1 the cross section times branching ratio can be measured to a precision of 
15% if the background from e + e~ — > can be reduced using tagging of electrons down to an angle of 

40 mrad with an efficiency of 95%. The effect of yy — > hadrons has been taken into account conservatively 
by only including it in the signal sample and thus reducing its reconstruction efficiency. 

From experience of the LEP experiments one can assume that the systematic uncertainties related to 
detector effects are of the order of 1% or less. For the measurement of OzQ-^+n- at LEP the systematic 
uncertainty was between 0.1 and 0.4%, depending on the experiment. Thus we expect that the systematic 
uncertainty of the H — > fi + \£~ analysis will be negligible compared to the statistical uncertainty, the un- 
certainty of H — »■ bb analysis will be dominated by the systematic and theoretical uncertainties and in the 
H — > cc analysis the uncertainty sources will contribute comparably. 
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